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CONSTRUCTION OF UNI-DIRECTIONALLY CLONED cDNA LIBRARIES FROM 
MESSENGER RNA FOR IMPROVED 3' END DNA SEQUENCING 

FIELD OF INVENTION 

The present invention relates generally to the field of amplifying nucleic acids, more 
particularly to methods for producing cDNA from mRNA, sequencing DNA, and 
constructing cDNA libraries. 

BACKGROUND OF THE INVENTION 

The characterization of cell specific gene expression finds application in a 
variety of disciplines, such as in the analysis of differential expression between different 
tissue types, different stages of cellular growth or between normal and diseased states, and the 
like. Fundamental to the characterization of cell specific gene expression is the detection of 
mRNA, and the construction of comprehensive cDNA libraries. However, the detection of 
mRNA is often complicated by one or more of the following factors: cell heterogeneity, 
paucity of material, or limits of low abundance mRNA detection. 

In a general method of constructing cDNA libraries, polyA mRNA is prepared from 
the desired cells and the first strand of the cDNA is prepared from the poIyA mRNA using a 
RNA-dependent DNA polymerase ("reverse transcriptase") and an oligodeoxynucleotide 
primer of 12 to 18 thymidine residues. In another method, the primer contains one or two 
nucleotides at one end that can hybridize to the mRNA sequence upstream of the poly A tail. 
Usually, the first polyA-non-compIementary nucleotide is a deoxyadenylate, deoxyguanylate, 
or deoxycytidylate ("dC"), and the second nucleotide can be any deoxynucleotide. The use of 
2 nucleotides can provide a moreaccurate positioning of the primer at the junction between 
mRNA and the polyA tail. 

The second strand of the cDNA is synthesized by one of several methods, the more 
efficient of which are commonly known as "replacement synthesis" and "primed synthesis." 
Replacement synthesis involves the use of ribonuclease H ("RNAase H"), which cleaves the 
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phosphodiester backbone of RNA that is in a RNA:DNA hybrid leaving a 3' hydroxyl and a 5' 
phosphate, to produce nicks and gaps in the mRNA strand, creating a series of RNA primers 
that are used by E. coli DNA polymerase I, or its "Klenow" fragment, to synthesize the 
second strand of the cDNA. This reaction is very efficient; however, the cDNAs produced 
most often lack the 5' terminus of the mRNA sequence. 

Primed synthesis to generate the second cDNA strand is a general name for several 
methods which are more difficult than replacement synthesis yet clone the 5' terminal 
sequences with high efficiency. In general, after the synthesis of the first cDNA strand, the 3' 
end of the cDNA strand is extended with terminal transferase, an enzyme which adds a 
homopolymeric "tail" of deoxynucleotides, most commonly deoxycytidylate. This tail is then 
hybridized to a primer of oligodeoxyguanidylate or a synthetic fragment of DNA with an 
deoxyguanidyiate tail and the second strand of the cDNA is synthesized using a 
DNA-dependent DNA polymerase. 

Once both cDNA strands have been synthesized, the cDNA library is constructed by 
cloning the cDNAs into an appropriate plasmid or viral vector. In practice this can be done 
by direcdy ligating the blunt ends of the cDNAs into a vector which has been digested by a 
restriction endonuclease to produce blunt ends. Blunt end ligations are very inefficient, 
however, and this is not a common method of choice. A generally used method involves 
adding synthetic linkers or adapters containing restriction endonuclease recognition 
sequences to the ends of the cDNAs. The cDNAs can then be cloned into the desired vector at 
a greater efficiency. 

One potentifd problem with the current method of constructing cDNA libraries is that 
the hybridization of the oligo dT primer to the poiyA tail of the mRNA in the initial step is 
not perfect. The primer does not necessarily accurately position at the junction between the 
mRNA and its polyA tail. Therefore, there may be continuous stretches of T's in addition to 
the T's on the first strand primer. While this does not usually affect efficiencies in 
sequencing from the 5' end, it severly compromises the ability to obtain accurate and 
successful sequencing from the 3" (poiyA tail) end. Thus, there exists a need for methods and 
procedures of cDNA synthesis and cloning. 
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SUMMARY OF THE INVENTION 

Methods are provided for obtaining a DNA complementary to a mRNA by contacting 
the mRNA having a polyadenosine (poly A) tail with a primer mixture, where each primer in 
the mixture comprises at least 5 contiguous deoxythymidines and at least 2 independently 
selected non-deoxythymidine nucleotides near one end, and reverse transcribing the mRNA 
using a reverse transcriptase to produce a DNA strand complementary to the mRNA. 

Methods are also provided for obtaining a DNA complementary to a mRNA by 
contacting the mRNA having a polyA tail with a primer mixture, where each primer in the 
mixture comprises at least 10 contiguous deoxythymidines and a non-poly A-complementary 
region near one end, and reverse transcribing the mRNA using a reverse transcriptase to 
produce a DNA strand complementary to the mRNA. The non-poly A-complementary region 
is selected from the group consisting of 3'-VV, 3'-VTV, 3'-VTVV, 3'-VTVW, 3'-VTVVTV, 
3'-VTTV, 3'-VTTTV, 3'-VVTVW, and 3'-VWVV, and combinations thereof, wherein V is 
deoxyadenosine, deoxycytidine, or deoxyguanosine, and the primer mixture may contain 
primers that are sense, anti-sense, or double stranded, and may contain a double stranded 
restriction enzyme sequence near the end opposite to the one containing the non- 
deoxythymidine nucleotides. 

Methods are also provided for producing uni-directionally cloned complimentary 
DNA libraries from mRNA by contacting the mRNA having polyadenylated tails with a 
primer mixture, wherein each primer in the mixmre has at least 10 contiguous 
deoxythymidines and at least two non-deoxythymidine nucleotides near one end and a double 
stranded restriction enzyme sequence at the opposite end, reverse transcribing the mRNA 
using a reverse transcriptase to produce a DNA strand complementary to the mRNA, 
modifying the complementary DNA strand wherein the polyT tail is substantially removed, 
and amplifying the modified cDNA strand by inserting the strand into a cloning vector uni- 
directionally, and amplifying using a DNA polymerase. 

Methods are also provided for producing uni-directionally cloned complimentary 
DNA libraries from mRNA by contacting the mRNA having a polyA tail with a primer 
mixture wherein each primer in the mixture has at least 15 contiguous deoxythymidines 
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having a restriction enzyme site at one end and a non-poly A-complementary region near the 
opposite end, wherein the non-poly A-complementary region is selected from the group 
consisting of 3'-VV, 3'-VTV, 3'-VTW, 3'-VTWV, 3'-VTVVTV, 3'-VTTV, 3'-VTTTV, 3'- 
VVTVVV, and 3'-VVVW, and combinations thereof, wherein V is deoxyadenosine, 
deoxycytidine, or deoxyguanosine, reverse transcribing the mRNA using a reverse 
transcriptase to produce a cDNA strand having a poiyT tail, modifying the cDNA strand 
wherein the polyT tail is substantially removed, and amplifying the modified cDNA strand by 
inserting the strand into cloning vector uni-directionaily, and amplifying using a DNA 
polymerase. The primer mixture may contain primers that are sense, anti-sense, or dotibie ■ 
stranded, and may contain a restriction enzyme site near the end opposite to the one 
containing the non-deoxythymidine nucleotides. 

These ^d other objections, advantages, and features of the invention will becon% 
apparent to those persons skilled in the art upon reading the details of the invention as more 
fully described below. 

BRffiF DESCRIPTION OF THE DRAWINGS 
Figure 1 provides a schematic representation the mixture of primers. 

DETAILED DESCRIPTION 

Before the present methods and kits are described, it is to be understood that this 
invention is not limited to particular methods and kits described, as such may, of course, vary. 
It is also to be understood that the terminology used herein is for the purpose of describing 
particular embodiments only, and is not intended to be limiting, since the scope of the present 
invention will be limited only by the appended claims. 

Unless defined otherwise, all technical and scientific terms used herein have the same 
meaning as conraionly understood by one of ordinary skill in the art to which this invention 
belongs. Although any methods and materials similar or equivalent to those described herein 
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can be used in the practice or testing of the present invention, the preferred methods and 
materials are now described. All publications mentioned herein are incorporated herein by 
reference to disclose and describe the methods and/or materials in connection with which the 
publications are cited. 

DEFINITIONS 

It must be noted that, as used in the specification and the appended claims, the 
singular forms "a," "an" and "the" include plural referents unless the context clearly dictates 
otherwise. Thus, for example, reference to "a primer" means that more than one such primer 
can be present in the composition. 

The term "primers" are short nucleic acids, whether occurring naturally as in a purified 
restriction digest or produced synthetically, usually DNA oligonucleotides, which may be 
annealed to a target polynucleotide by complementary base-pairing. The primer may then be 
extended along the target DNA strand by a DNA polymerase enzyme. Primer pairs can be used for 
amplification (and identification) of a nucleic acid sequence, e.g., by the polymerase chain reaction 
(PGR). 

The term "substantially pure" means an object species is the predominant species 
present (i.e., on a molar basis it is more abundant than any other individual macromolecular 
species in the composition), and preferably a substantially purified fraction is a composition 
wherein the object species comprises at least about 50 percent (on a molar basis) of all 
macromolecular species-present. Generally, a substantially pure composition will comprise 
more than about 80 to 90 percent of all macromolecular species present in the composition. 

The terms "complementary" and "complementarity" refer to the natural binding of 
polynucleotides by hydrogen bond base pairing. For example, the sequence "5' A-G-T 3'" 
with the complementary sequence "3' T-C-A 5'." Complementarity between two single- 
stranded molecules may be "partial," such that only some of the nucleic acids bind, or it may 
be "complete," such that total complementarity exists between the single stranded molecules. 
The degree of complementarity between nucleic acid strands has significant effects on the 
efficiency and strength of the hybridization between the nucleic acid strands. This is of 
particular importance in amplification reactions 
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Methods for PCR amplification are described in the art (PCR Technology: Principles 
and Applications for DNA Amplification ed. H A Erlich, Stockton Press, New York, N.Y. 
(1989); PCR Protocols: A Guide to Methods and Applications, eds. Innis, Gelfland, Snisky, 
and White, Academic Press, San Diego, Calif. (1990); Mattila et al. (1991) Nucleic Acids 
Res. 19: 4967; Eckert, K. A. and Kunkel, T. A. (1991) PCR Methods and Applications 1: 17 
each of which are incorporated herein by reference). 

' In one embodiment, cDNA corresponding to mRNA is synthesized by first contacting 
the mRNA having a polyA tail with a mixture of primers. The primer mixture contains 
primers that have at least 10 contiguous deoxy thymidines, a restriction enzyme site near one 
end and a non-poly A-complementary region near the other end, where the non-poly A- 
complementary region is -VV, -VTV, -VTTV, -VTTTV, and -VWW. The resultant cDNA 
is modified such that the polyT tail is substantially removed, and subsequently ligated into a 
vector for the synthesis of cDNA libraries. 

Primers as used in the present invention typically comprise at least 5 contiguous 
nucleotides of a known sequence. In order to enhance specificity, longer probes and primers 
may also be employed, such as probes and primers that comprise at least 10, 13, 15, 17, 19, 
20, 25, 30, 40, 50, 60, 70, 80, 90, 100, or at least 150 nucleotides. Probes and primers may be 
considerably longer than these examples, and it is understood that any length may be used. 

In one embodiment of the invention, the mixture of primers contains a first primer that 
has at least 5 nucleotides capable of hybridizing to the poly A tail and at least one nucleotide 
near one end, preferably the 3'-end, that can hybridize to an mRNA sequence that is 
immediately upstream of the polyA tail, and contains, and at least a second primer that has at 
least 5 nucleotides capable of hybridizing to the polyA tail and at least two nucleotide near 
one end, preferably the 3*-end, that can hybridize to an mRNA sequence that is immediately 
upstream of the polyA tail. Thus, the mixture may contain primers having the sequences 5'- 
TTTTTV (Seq. ED. No. 1), 5'-TTTTTVV (Seq. ID. No. 2), 5'-TTTTTVTV (Seq. ID. No. 3), 
S'-TTTTTVTTV (Seq. ID. No. 4), 5'-TTTTTVTTTV (Seq. ID. No. 5), and the like where V 
is deoxyadenylate ("dA"), deoxyguanylate ("dO"), or deoxyeytidylate ("dC"). The mixture 
may contain the primers, such as in Seq. ID. No. 1 through 5, in any ratio. 

In some embodiments, the mixture may contain primers having 1, 2, or 3 nucleotides 
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near the 3 '-end that can hybridize to an mRNA sequence that is immediately upstream of the 
poIyA tail. Thus, the mixture may contain, in addition to primers of Seq. ID. Nos. i-5 above, 
primers having the sequences 5'-TTTTTVVV (Seq. ED. No. 6), 5'-TTTTTVTVV (Seq. ID. 
No. 7), 5'-TTTTTVVTV (Seq. ID. No. 8), 5'-TTTTTVTVTV (Seq. ID. No. 9), and the like in 
ratio. The addition of each nucleotide near the 3'-end will further increase the stability of 
properly aligned hybrids. In general, the sequence to hybridize to the polyA tail can be 
decreased by one nucleotide for each additional non-poly A-complementary nucleotide added. 

In some embodiments, the mixture may contain primers having more than 3 
n'ueleotides near the 3'-end that can hybridize to the mRNA sequence that is immediately 
upstream of the poly A tail, including primer having the sequence 5'-TTTTTVVVVV (Seq. 
ID. No. 10). The 3 or more nucleotides near the 3'-end may be in present in any possible 
combination, and the mixture may contain the primers in any possible ratio of concentrations. 
In the most preferred embodiment, the primer mixmre contains primers of Seq. ID. No. 2, 3, 
4, 5, and 10. The mixture preferably contains Seq. ID. No. 2 at a concentration of about 2 % 
to about 30 %, preferably about 10 % to about 25 %, most preferably about 15 % to about 20 
% of the total concenoration of the primers; Seq. ID. No. 3 at a concentration of about 0.5 % 
to about 10 %, preferably about 2 % to about 8 %, most preferably about 3 % to about 6 % of 
the total concentration of the primers; Seq. ID. No. 4 at a concentration of about 0.05 % to 
about 10 % preferably about 0. 1 % to about 5 %, most preferably about 0.5 % to about 3 % of 
the total concentration of the primers; Seq. ID. No. 5 at a concentration of about 0.005 % to 
about 1 % preferably about 0.01 % to about 0.5 % of, most preferably about 0.005 % to about 
0.05 % the total concentration of the primers; and Seq. ID. No. 10 at a concenUration of upto 
about 95 % , preferably about 60 % to about 80 % of the total concentration of the primers. 
The resultant mixture of primers may be used in a single reaction for the priming of the 
mRNA for reverse transcription. 

In the preferred embodiment, each of the primers in the mixture contains 10 to 20 
nucleotides that are capable of hybridizing to the polyA tail, 2 or more nucleotides at one end 
that can hybridize to the mRNA sequence that is inunediately upstream of the poly A tail, and 
a restriction endonuclease recognition sequence at the other end. Preferably, the restriction 
enzyme sequence is the same for all the primers in the mixture, and is double stranded. The 
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endonuclease recognition sequence can be any, as is known in the art, and may include Bpal. 

The polynucleotide primers may be prepared using any suitable method, such as, for 
example, the phosphotriester and phosphodiester methods, or automated embodiments 
thereof. In one such automated embodiment diethylphosphoraraidites are used as starting 
materials and may be synthesized as described by Beaucage et al. (1981) Tetrahedron Letters 
' 22: 1859. One method for synthesizing oligonucleotides on a modified solid support is 
described in U.S. Pat. No. 4,458,066. It is also possible to use a primer which has been 
isolated from a biological source, such as a restriction endonuclease digest or the like. 
Methods for preparing and using probes and primers are described in the refe^nces, for 
example Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual . 2"** ed., vol. 1-3, 
Cold Spring Harbor Press, Piainview NY; Ausubel et al.,1987. Current Protocols in 
Molecular Biology . Greene Pubi. Assoc. & Wiley-Intersciences, New York NY; Innis et al., 
1990, PCR Protocols. A Guide to Methods and Applications . Academic Press, San Diego 
CA. PCR primer pairs can be derived from a known sequence, for example, by using 
computer programs intended for that purpose such as Primer (Version 0.5, 1991, Whitehead 
Instimte for Biomedical Research, Cambridge MA). 

The mixture of primers may consist of both the sense and the antisense complement. 
The sense and antisense mixmre of primers may be selected as described in more detail 
above, and may be present in any stoichiometry, but are generally present in approximately 
equimoiar concentrations in the reaction fluid. Thus, double stranded mixture of primers may 
be used. In addition, the sense and antisense sequences can be covalently attached using a 
variety of methods. In one method, the two strands can be linked as a hairpin. Additionally, 
the strands can be synthesized as a double hairpin and/or ligated to form a closed dumbell 
(Annu. Rev. Biophys, Biomol. Struct., 25, 1-28, (1996)). Other methods include covalenUy 
coupling the sense and antisense strands using standard chemical approaches. One such 
method of coupling incorporates linker-arms in the complementary strand, where the linker- 
arm is modified to react primarily with a functional moiety. 

The use of die mixture of primers described in more detail above allows for the 
synthesis of cDNA to begin accurately at the start of the polyA tail of the gene, even in 
instances where the poly A tail is imperfect in that it may contain non-A nucleotides. The art 
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method of preparing cDNA from mRNA usually result in about 60% successful reads when 
sequencing from the 3' (poly A tail) end is attempted. In contrast, the use of the mixture of 
primers of the present invention results in greater than 80% successful reads. 

The cDNA prepared by the use of the mixture of primers described in more detail 
above, may be subsequently modified where the stretch of cDNA corresponding to the 
mRNA poly- A tail is substantially removed. The removal of the dT stretch of cDNA can be 
by any means, such as chemical means, enzymatic means, or other means known in the art, 
and the entire polyT tail may be removed, or a substantial portion, of it may be removed. In 
the preferred embodiment, the restriction endonuclease recognition sequence is used to 
remove the dT stretch upstream of the site, wherein at least 70 % of the dT nucleotides are 
removed. 

The modified cDNA can then be amplified by adding a second known primer, by 
adding a random primer, or by inserting it unidirectionally into a cloning vector. 
Oligonucleotides for use as primers may be selected using software known in the art for such 
purpose. For example, OLIGO 4.06 software is useful for the selection of PGR primer pairs 
of up to 100 nucleotides each, and for the analysis of oligonucleotides and larger 
polynucleotides of up to 5,000 nucleotides from an input polynucleotide sequence of up to 32 
kilobases. Similar primer selection programs have incorporated additional features for 
expanded capabilities. For example, the PrimOU primer selection program (available to the 
public from the Genome Center at University of Texas South West Medical Center, Dallas 
TX) is capable of choosing specific primers from megabase sequences and is thus useful for 
designing primers on a genome-wide scope. The Primer3 primer selection program 
(available to the public from the Whitehead Institute/MIT Center for Genome Research, 
Cambridge MA) allows the user to input a "mispriming library." in which sequences to avoid 
as primer binding sites are user-specified. PrimerS is useful, in particular, for the selection of 
oligonucleotides for microarrays. (The source code for the latter two primer selection 
programs may also be obtained from their respective sources and modified to meet the user's 
specific needs.) The PrimeGen program (available to the public from die UK Human 
Genome Mapping Project Resource Centre, Cambridge UK) designs primers based on 
multiple sequence alignments, thereby allowing selection of primers that hybridize to either 
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the most conserved or least conserved regions of aligned nucleic acid sequences. Hence, this 
program is useful for identification of both unique and conserved oligonucleotides and 
polynucleotide fragments. The oligonucleotides and polynucleotide fragments identified by 
any of the above selection methods are useful in hybridization technologies, for example, as 
PCR or sequencing primers, microarray elements, or specific probes to identify fully or 
partially complementary polynucleotides in a sample of nucleic acids. Methods of 
oligonucleotide selection are not limited to those described above. 

In the preferred embodiment, the modified cDNA sequence is inserted into a cloning 
vector uni-directionally by methods known in the art. Gubler, U and Hoffman, B.J. (1983) 
" A simple and very efficient method for generating cDNA libraries" Gene 25:263.; 
Coleciough, C. and Eriitz.F. (1985) "Use of primer restriction end adaptors in a novel cDNA 
cloning strategy" Gene. 34:305. The cDNA libraries may be constructed with the UNIZAP 
vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using the 
recommended procedures or similar methods known in the art. For most libraries, the cDNA 
may be size-selected (300-1000 bp) using SEPHACRYL SIOOO, SEPHAROSE CL2B, or 
SEPHAROSE CL4B column chromatography ( Amersham Pharmacia Biotech) or preparative 
agarose gel electrophoresis. cDNAs may be ligated into compatible restriction enzyme sites 
of the polylinker of a suitable plasmid, e.g., PBLUESCRDPT plasmid (Stratagene), PSPORTl 
plasmid (Life Technologies), PCDNA2. 1 plasmid (Invitrogen, Carlsbad CA), or pINCY 
(Incyte Pharmaceuticals, Palo Alto CA). Recombinant plasmids may be transformed into 
competent E. coli cells including XLl-Blue, XLl-BIueMRF, or SOLR from Stratagene or 
DH5a, DHIOB, or ElectroMAX DHIOB from Life Technologies. 

The cDNA libraries can be used to determine quantitative information about the 
genetic profile of the nucleic acids in the sample that was contacted with the array to generate 
the hybridization pattern, as well as the physiological source from which the labeled sample 
nucleic acid was derived. The data provides information about the physiological source from 
which the sample nucleic acid were derived, such as the types of genes expressed in the tissue 
or cell which is the physiological source, as well as the levels of expression of each gene, 
particularly in quantitative terms. 

The present method can be used in comparing nucleic acid samples from two or more 
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physiologicai sources to identify and quantify differences between the patterns thereby 
providing data on the differential expression of a particular gene in the physiologicai sources 
being compared. Thus the methods of the invention find use in differential gene expression 
assays for the analysis of a diseased and normal tissue, analysis of a different tissue or 
subtissue types, and the like. Thus, this data may be used for large scale correlation studies 
on the sequences, mutations, variants, or polymorphisms among samples. 

EXAMPLES 

The following examples are put forth so as to provide those of ordinary skill in the art 
with a complete disclosure and description of how to make and use the present invention, and 
are not intended to limit the scope of what the inventors regard as their invention nor are they 
intended to represent that the experiments below are all or the only experiments performed. 
Efforts have been made to ensure accuracy with respect to numbers used (e.g. amounts, 
temperatures, etc.) but some experimental errors and devi^ons should be accounted for. 
Unless indicated otherwise, parts are parts by weight, molecular weight is weight averse 
molecular weight, temperature is in degrees Centigrade, and pressure is at or near 
atmospheric. 

EXAMPLE 1 

This example serves to demonstrate a method of making the mixture of primers for 
use in the invention. 

The primers having the sequences of Seq I.D. No. i 1 through i5 16 (below), also 
shown in Figure 1, were made by synthetic methods. 



5'pGACTAGTrCTAGATCGCGACTGGA'rrrJ 


ITriTlTri 


n-rrriTW 


Seq ID. No. 


11 


5'pGACTAGTrCTAGATCGCGACTGGA'lTri 


ri'rrrriTi 


rriTrnvTV 


Seq ID. No. 


12 


5'pGACTAGTTCTAGATCGCGACTGGA riTJ 


iTrrrri'i'i 


riTnTrvnv 


Seq ID. No. 


13 


5'pGACTAGTTCTAGATCGCGACTGGArri'J 


I'lTrnrn 


rrn-rrrvnTv 


Seq ID. No. 


14 


5'pGACTAGTTCTAGATCGCGACTGGArn"J 


LTiTiTrn 


rrrrrnvvvvv 


Seq ED. No. 


15 
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The primer mixture made by mixing together about 18.85% of the double stranded primer 
having Seq I.D. No. 11, about 4.69% of the double stranded primer having Seq I.D. No. 12 , 
about 1.17% of the double stranded primer having Seq I.D. No. 13, about 0.029% of the 
double stranded primer having Seq LD. No. 14 , and about 75.0% of the double stranded 
primer having Seq I.D. No. 15. 

EXAMPLE 2 

This example serves to demonstrate a method for the synthesis of the first strand. 

mRNA was synthesized or isolated from tissue samples. The tissue samples were 
homogenized and iysed in guanidinium isothiocyanate, or they were homogenized and iysed 
in phenol or in a suitable mixture of denaturants, such as TRIZOL (Life Technologies), a 
monophasic solution of phenol and guanidine isothiocyanate. The resulting lysates were 
centrifuged over CsCl cushions or extracted with chloroform. RNA was precipitated from the 
lysates with either isopropanol or sodium acetate and ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase 
RNA purity. In some cases, RNA was treated with DNase. For most libraries, poiy(A+) 
RNA was isolated using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX 
latex particles (QIAGEN, Chatsworth CA), or an OLIGOTEX mRNA purification kit 
(QIAGEN). Alternatively, RNA was isolated directly from tissue lysates using other RNA 
isolation kits, e.g., the POLY(A)PURE mRNA purification kit (Ambion, Austin TX). 

The mRNA (1.5 pg) was resuspended in 10 yl of DEPC treated water, denatured by 
heating the RNA to about 70 °C for 10 min., and then quick-chilled on ice. Reverse 
transcription was initiated using the primer mixture of Example I (0.5 jag in 1 |il of water), 
heating the mixture to about 70 °C for 10 min., and then quick-chill on ice. The mixtiu-e was 
briefly centrifuged, and the solid contents at the bottom of the tube were collected. The 
contents were then cooled on ice, and 4 ml 5X first strand buffer, 2 ml 0.1 M DTT, 1 ml 10 
mM dNTP (with methyl dCTP) mix, and 1 ml a-[^¥]dCTP (10 uCi/ul) were added and the 
contents were mixed by gently vortexing. The reaction mixture was briefly centrifuged and 
the solid contents at the bottom were collected. To the solid contents were added 1 |il of 
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deletion mut_MMLV-RT (200 ^/jii) from Promega or Superscript I, and the total reaction 
volume was about 20 fil. Other reverse transcriptases, point mutations or native should not be 
substituted. The reaction mixture was gently mixed and incubated at about 37 "C for 
approximately 1 h. and then incubated at 70 °C for approximately 10 min., followed by quick 
chill on ice. 

EXAMPLES 

This example serves to demonstrate a method for the synthesis of the second strand. 

The reaction tube from Example 2, containing the first strand was placed on ice, and 
92 |il of DEPC-treated water, 30 pi of 5X second strand buffer, 3 fil of 10 mM dNTP mixture, 
1 ul 1 00 mM dCTP, 1 pi of E. coli ligase ( 10 U/pi), 4 pi of E. coli DNA polymerase 1(10 
U/|il), and I pi of E.coii RnaseH (2 U/pi) were added. The reaction mixture was gently 
mixed, incubated at 16 °C for 2 h, followed by the addition of 2 pi of DNA polymerase with 
further incubation at 16 "C for 5 min. The sample was then partitioned into approximately 
two equal parts and each was size selected using a Chromospin CR-1000 colunm (Clontech). 
Next, an equal volume of phenol:chloroform:isoamyI alcohol (25:24:1) was added, and the 
reaction mixmre was stirred by vortexing, and the two phases were separated by 
centrifiigation at room temperature for 5 min. at 14,000 x g. The aqueous layer was removed 
and transferred to a new tube. To the aqueous layer were added about Yt volume of 7.5 M 
NH4OAC, I pi of glycogen (20 pg) followed by 2.5 volumes of 100% ethanol. The reaction 
mixture was stirred by vortexing followed by centrifugation at 14,000 x g for 15 min. at room 
temperature. The supernatant was careftilly removed, and the pellet rinsed twice with 200 pi 
of cold 70% ethanol by centrifugation at 14,000 x g for 2 min. The ethanol was removed, and 
the pellet was resuspended in 34 pi of water, and 10 pi of 5X T4 ligase buffer, 1 pi of EcoRl- 
Xhol adapters (1 pg/pl; Stratagene) and 5 pi of T4 DNA ligase were added. The final volume 
of the reactants was about 50 pi. The mixture was gently mixed and incubated overnight at 
16 °C. The next day, the about 150 pi of DEPC water was added, and an equal volume of 
phenol:chloroform:isoamyl alcohol (25:24:1) was added, and the reaction mixmre was stirred 
by vortexing, and the two phases were separated by centrifugation at room temperature for 5 
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min. at 14,000 x g. The aqueous layer was removed and transferred to a new 1.5 ml tube. To 
the aqueous layer were added about Vz volume of 7.5 M NH^OAc, 1 fil of glycogen (20 ^g) 
followed by 2.5 volumes of 100% ethanol. The reaction mixture was stirred by vortexing 
followed by centrifugation at 14,000.x g for 15 min. at room temperature. The supernatant 
was carefully removed, and the pellet rinsed twice with 200 |il of cold 70% ethanol by 
centrifugation at 14,000 x g for 2 min. The ethanol was removed by air drying the pellet for 
5-10 min, and the pellet was resuspended in 40.5 yl of DEPC water for digestion with NE& 
Bpm I restriction enzyme (NEB). 

EXAMPLE 4 

This example serves to demonstrate the method used to shorten the poIyA tail-primer 
complement in the cDNA to generate a 3' -A-A overhang. 

To die solution from Example 3 were added 5 jil of lOX buffer 3, 0.5 ^I of lOOX BSA 
and 4 |il of Bpm I (NEB). The mixture was mixed gendy, and incubated for at least 4-5 h at 
37 °C. Then 150 |id of DEPC water was added, and an equal volume of 
phenol:chloroform:isoamyl alcohol (25:24: 1) was added, and the reaction mixture was stirred 
by vortexing, and die two phases were separated by centrifugation at room temperature for 5 
min. at 14,000 x g. The aqueous layer was removed and transferred to a new 1.5 ml mbe. To 
the aqueous layer were added about '/a volume of 7.5 M NH4OAC, 3.5 |il of yeast tRNA (1 
jig/nl) followed by 2.5 volumes of 100% ethanol. The reaction mixmre was stirred by 
vortexing followed by centrifugation at 14,000 x g for 15 min. at room temperature. The 
supernatant was carefully removed, and the pellet rinsed twice with 200 jil of cold 70% 
ethanol by centrifugation at 14,000 x g for 2 min. The pellet was air dried bor 5-10 min, and 
re-suspended in 70 ^l of STB. 

EXAMPLE 5 

This example serves to demonstrate the ligation of cDNA into KS+/BsgI 
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The cDNA of Example 4 was size selected using a sepharose column. The vector was 
prepared by known methods by cutting sequentially with Bsgl overnight, then with EcoRI- 
and finally with BamiHI, The cut vector was purified by gel chromotography. 

Into a 1.5 ml micro-centrifuge tube were placed 2 |nl 5X DNA ligase buffer, 1 \d (25 
ng/^J) of the cut vector, 10 ng of the purified cDNA, and water to a total volume of 9 jil. To 
the mixture was then added 1 jii of T4 DNA ligase. The reaction mixture was mixed by 
gently vortexing, and the contents were quick-spun to the bottom of the tube, and incubated at 
room temperature for 2 h, or overnight at 16 °C. 



EXAMPLE 6 

This example presents the results from DNA sequencing. 

Out of the 576 clones that were attempted in cycle sequencing for three rat libraries, 
usable sequences were obtained from 503 clones (87.3 % success rate). The average usable 
length was 588 base pairs with an average of 563 bases with Phred 20 or higher. There was 
only trace to non-detectable levels of contamination with ribosomal, mitochondrial or clones 
with poly A/T stretches. When a comparison against Rat EST database was performed, a 
very high confirmation rate was obtained between the sequences using the method of the 
present method and ESTs produced by the TN-wobble primer based sequencing. In total 269 
pairs of sequences were analyzed. A significant number of clones produced 3' extensions to 
our database, and only 8 clones were more than 50 base pairs 5' of the database entry. 
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